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Abstract — This paper considers the non-cooperative maximiza- 
tion of mutual information in the vector Gaussian interference 
channel in a fully distributed fashion via game theory. This 
problem has been widely studied in a number of works during 
the past decade for frequency-selective channels, and recently 
for the more general MIMO case, for which the state-of-the art 
results are valid only for nonsingular square channel matrices. 
Surprisingly, these results do not hold true when the channel 
matrices are rectangular and/or rank deficient matrices. 

The goal of this paper is to provide a complete characterization 
of the MIMO game for arbitrary channel matrices, in terms 
of conditions guaranteeing both the uniqueness of the Nash 
equilibrium and the convergence of asynchronous distributed 
iterative waterflUing algorithms. Our analysis hinges on new 
technical intermediate results, such as a new expression for the 
MIMO waterfilling projection valid (also) for singular matrices, 
a mean-value theorem for complex matrix-valued functions, 
and a general contraction theorem for the multiuser MIMO 
watefilling mapping valid for arbitrary channel matrices. The 
quite surprising result is that uniqueness/convergence conditions 
in the case of tall (possibly singular) channel matrices are more 
restrictive than those required in the case of (full rank) fat 
channel matrices. 

We also propose a modified game and algorithm with milder 
conditions for the uniqueness of the equilibrium and convergence, 
and virtually the same performance (in terms of Nash equilibria) 
of the original game. 

Index Terms — Game Theory, MIMO Gaussian interference 
channel, Nash equilibrium, totally asynchronous algorithms, 
waterfilling. 

I. Introduction 

THE interference channel is a mathematical model relevant 
to many communication systems where multiple uncoor- 
dinated links share a common communication medium (see, 
e.g., |r|-[3|). In this paper, we focus on the MIMO Gaus- 
sian interference channel and consider the non-cooperative 
maximization of mutual information on each link in a fully 
distributed fashion using a game theoretical approach. The 
system is modeled as a (strategic non-cooperative) game where 
every MIMO link is a player who competes against the 
others by choosing his transmit covariance matrix (transmis- 
sion strategy) to maximize his own information rate (payoff 
function), given constraints on the transmit power and treating 
the interference generated by the other users as additive noise 
(implying that multiuser encoding/decoding and interference 
cancellation techniques are not considered). Based on the 
celebrated notion of Nash Equilibrium (NE) in game theory 
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(see, e.g., ||4l, 13]), an equilibrium for the whole system is 
reached when every player's reaction is "unilaterally optimal", 
i.e., when, given the rival players' current strategies, any 
change in a player's own strategy would result in a rate loss. 

Since the seminal paper by Yu et al. [61 in 2002 (and the 
conference version in 2001), this problem has been studied 
in a number of works during the past seven years for the 
case of SISO frequency-selective channels or, equivalently, a 
set of parallel non-interfering scalar channels fTl- lfTSl . Several 
sufficient conditions have been derived that guarantee the 
uniqueness of the NE and the convergence of different dis- 
tributed waterfilling based algorithms: synchronous sequential 
pel-fOl, nil, US, Ha, synchronous simultaneous lfTUl-lfT3l. 
1 15 1 and asynchronous lfT2l . See [161 for a unified view of the 
state-of-the-art results in SISO frequency-selective interference 
channels. 

The more general MIMO case, as considered in this paper, 
is a nontrivial extension of the SISO frequency-selective 
case. There are indeed only a few papers that have studied 
(special cases of) the MIMO game HD-lIl]. In |[T7], the 
authors focused on the two-user MISO flat-fading channel. 
In fT8l-f20l, the authors considered the rate maximization 
game in MIMO interference channels, but they provided only 
numerical results to support the existence of a NE of the 
game. Furthermore, in these papers there is no study of the 
uniqueness of the equilibrium and convergence of the proposed 
algorithms. In 11211 . the authors showed that the MIMO rate 
maximization game is a concave game (in the sense of |22|), 
implying the existence of a NE for any set of arbitrary channel 
matrices |[22l Theorem 1]. As far as the uniqueness of the 
equilibrium is concerned, only fT\\ showed that if the channel 
matrices are full column-rank and the multiuser interference is 
almost negligible, then the NE is unique, without quantifying 
how small the interference must be. Finally, in fT6|, the 
authors provided sufficient conditions that, differently from 
1 21], explicitly quantify how strong the multiuser interference 
can be to guarantee both the uniqueness of the equilibrium 
and the convergence of distributed algorithms. However, the 
results in |16 | are valid only for the MIMO game with square 
nonsingular channel matrices. Hence, the study of the game 
in the case of arbitrary channel matrices has not been done 
to date. 

The goal of this paper is to fill this gap and provide 
a complete analysis of the MIMO game, without making 
any restrictive assumption on the structure of the channel 
matrices, so that they can be rectangular and possibly rank 
deficient. Interestingly, the study of the game in this more 
general setup cannot be obtained by a trivial extension of the 
framework developed in 1 16 1, since the key results used in fTSI 
to derive conditions for both the uniqueness of the NE and 
the convergence of distributed algorithms— the interpretation 
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of the MIMO waterfilling solution as a projection of a proper 
nonsingular matrix and the contraction theorem of the MIMO 
muhiuser waterfilling mapping— /a// to hold in the more gen- 
eral case of rectangular and possibly rank deficient channel 
matrices. Thus, the study of the game hinges on new technical 
intermediate results, such as a new expression for the MIMO 
waterfilling projection valid (also) for singular matrices and 
a mean-value theorem for complex matrix-valued functions, 
which will play a key role in deriving one on the main results 
of the paper: a general contraction theorem for the multiuser 
MIMO watefilling mapping that is valid for arbitrary channel 
matrices. A quite surprising result follows from this new theo- 
rem: the sufficient conditions guaranteeing both the uniqueness 
of the fixed point of the MIMO waterfilling mapping— the 
Nash equilibrium of the MIMO game— and the convergence of 
totally asynchronous iterative waterfilling algorithms (IWFA) 
in the case of (strict) tall and/or singular channel matrices need 
to be stronger than those required for the case of full row-rank 
(and thus also square) channel matrices. 

To provide a weaker set of uniqueness and convergence 
conditions for both cases of tall and fat channel matrices, we 
propose a new best-response strategy for each player based on 
a modified game. The resulting new game admits a unique NE 
under a unified set of sufficient conditions that are valid for 
arbitrary channel matrices and coincide with the weakest ones 
obtained for the original game (i.e., those obtained in the case 
of full row-rank channel matrices). Such conditions are also 
sufficient for the convergence of an asynchronous distributed 
algorithm based on this new mapping. Numerical experiments 
show that the performance of both games is virtually the same. 

The paper is organized as follows. Section gives the 
system model and formulates the optimization problem as a 
strategic non-cooperative game. In Section |III] we derive the 
main properties of the multiuser MIMO waterfilling mapping: 
a new expression of the MIMO waterfilling solution as a 
matrix projection and the contraction theorem for the mul- 
tiuser MIMO waterfilling mapping, both vaUd for arbitrary 
channel matrices. The contraction property of the multiuser 
waterfilling paves the way to derive sufficient conditions 
guaranteeing the uniqueness of the fixed point of the water- 
filling projector— alias the NE of the MIMO game— and the 
convergence of iterative, possibly asynchronous, distributed 
algorithms as detailed in Sections |IV] and |V] respectively. 
Section [Vl] provides some numerical results validating our 
theoretical findings, and Section [VTIl draws some conclusions. 

The following notation is used in the paper. Uppercase 
and lowercase boldface denote matrices and vectors respec- 
tively. The operators (■)*, (■)" , (•)*, f {•}, vec(-) and Tr(-) 
are conjugate, Hermitian, Moore-Penrose pseudoinverse ll27l . 
expectation, stacking vectorization operator and trace operator, 
respectively. The range space and null space operators are 
denoted by TZ{-) and N{-), respectively. The operators < and 
> for vectors and matrices are defined component-wise, while 
A )^ B (or A ^ B) means that A — B is positive (or 
negative) semidefinite. The operator Diag( ) is the diagonal 
matrix with the same diagonal elements as the matrix (or 
vector) argument, denotes the (i, j) element of the matrix 
argument, denotes the Kronecker product operator ll26l . 



© denotes the direct sum ll26l . and (a;)+ = niax(0,x). 
The spectral radiu|3 of a matrix A is denoted by p(A). 
The operator Pa/'(a) (or P-r,(a)) denotes the orthogonal 
projection onto the null space (or the range space) of matrix 
A and it is given by Pa/-(a) = Nyi(Nf NA)"^Nf (or 
Pk(a) = Ra(RaRa)"^Ra). where (or R^) is any 
(strictly tall) matrix whose columns are linear independent 
vectors spanning 7V(A) (or 7e(A)) ||29l, lEO). The operator 
[X]^ = argmin2;g^ 1|Z— Xjl^ denotes the matrix projection 
with respect to the Frobenius nor of matrix X onto the 
(convex) set 3. The sets C, R, M+, R++, N+, and 
(or S"^") stand for the set of complex, real, nonnegative real, 
positive real, nonnegative integer numbers, and nxn complex 
positive semidefinite (or definite) matrices, respectively. 

II. The Rate Maximization Game 

We consider a vector Gaussian interference channel, com- 
posed of Q MIMO links. The transmission over the generic 
g-th MIMO channel with riT transmit and nn receive di- 
mensions can be described by the baseband signal model 

yg = Hq^Xg + ^HrgXr + n,, (1) 

where x^S C"^« is the vector transmitted by source 
q, H,,e C""*'^"^" is the direct channel of Hnk q, 
HrgG jg j-jjg cross-channel matrix between source 

r and destination q, £ C"^« is the vector received by desti- 
nation q, and n^G C""' is a zero-mean circularly symmetric 
complex Gaussian noise vector with arbitrary covariance ma- 
trix R„^ (assumed to be nonsingular). The second term on the 
right-hand side of ([T]i represents the Multi-User Interference 
(MUI) received by the g-th destination and caused by the other 
active links. For each transmitter q, the total average transmit 
power is 

f {||x,||2} =Tr(Q,) <F„ (2) 

where Qg = £ jx^x^} is the covariance matrix of the trans- 
mitted vector Xg, and Pq is the maximum average transmitted 
power in units of energy per transmission. 

Since our goal is to find distributed algorithms that require 
neither a centralized control nor coordination among the links, 
we focus on transmission techniques where no interference 
cancellation is performed and the MUI is treated as additive 
colored noise from each receiver. Each channel is assumed to 
change sufficiently slowly to be considered fixed during the 
whole transmission, so that the information theoretical results 
are meaningful. Moreover, perfect channel state information at 
both transmitter and receiver sides of each link is assumed^ 
each receiver is also assumed to measure with no errors the 
covariance matrix of the noise plus MUI generated by the 
other users. 

'The spectral radius p (A) of the matrix A is defined as p (A) = 
max{|A| : A G cr(A)}, with o"(A) denoting the spectrum of A |26|. 

-The Frobenius norm ||X||^ of X is defined as ||X||^ = (Tr(X^X)) ' 
126]. 

^Note that each user q is only required to known his own channel Hqq, 
but not the channels {^rq}r^q of the others. 
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Under these assumptions, invoking the capacity expression 
for the single user Gaussian MIMO channel— achievable us- 
ing random Gaussian codes by all the users— the maximum 
information rate on link q for a given set of users' covariance 
matrices Qi, . . . , Qq is 1241 

i?g(Q„ Q-,) = logdet (I + Hf,R:;(Q_,)HggQ,) (3) 

where 

R-g(Q-g) = Rn, + HrgQrHf, (4) 

r^q 

is the MUI-plus-noise covariance matrix observed by user q, 
Q_g = {Qr)r^q is the Set of all users' covariance matrices, 
except the g-th one. 

Given the above setup, we formulate the system design 
within the framework of game theory using as desirability 
criterion the concept of Nash equilibrium (NE) (see, e.g., 
im, jSj). Specifically, we consider the following strategic non- 
cooperative game: 



maximize _Rg(Qg,Q_g) 

Qg 

subject to Qg G cSg, 



Vg e (5) 



where ft = {1, . . . , Q} is the set of players (i.e., the links), 
^g(QgjQ-g) is the payoff function of player q defined in 
(HJ, and ^g is the set of admissible strategies (the covariance 
matrices) of player g, defined a^ 



^g = {Q e 



: Q^O, Tr{Q} = Pg}. (6) 



In game ^, each player q competes against the others by 
choosing the transmit covariance matrix Qg (i.e., his strategy) 
that maximizes his own information rate _Rg(Qg,Q_g), as 
given in (O, subject to the average transmit power constraint 
in (|6]l. A solution of the game— a NE— is reached when each 
user, given the strategy profiles of the others, does not get any 
rate increase by unilaterally changing his own strategy (see, 
e.g., a, 0, |iI6|). 

To write the Nash equilibria of game in a convenient 
form, we first introduce the following notations and defini- 



tions. Given 



and Q_g G ^_g = x 



X ^, 



9-1 



=Sg+i X ... X J^Q, we write the eigendecomposition of 
Hf^RlJ(Q_g)Hgg for each q e n as: 



H^R_j(Q-g)Hgg - UgDgU^, 



(7) 



where Ug = Ug(Q_g) G 



is a (semi-)unitary matrix 



with the eigenvectors, Dg = Dg(Q_g) G ' is a diagonal 

matrix with rg = rank(H^RlJ(Q_g)Hgg) — rank(Hgg) 
positive eigenvalues, and R_g(Q_g) is defined in (|4|i. 

Given g G il and Q_g G J2-q, the solution to the single- 
user optimization problem in (|5]l is the well-known waterfilling 
solution (e.g., Il24l '): 



Q* = WFg(Q_ 



(8) 



"^Observe that, in the definition of in j6) we omitted the redundant 
condition Q = Q^, since any complex positive semidefinite matrix must be 
necessarily Hermitian (26 Sec. 7.1]. Fuilhermore, there is no loss of generality 
in considering in ^6) the power constraint with equality rather than inequality, 
since at the optimum to each problem in js), the constraint must be satisfied 
with equality. 



with the waterfilling operator WFg (•) defined as 

WFg(Q_g)4Ug ^ B-')^ , 



(9) 



where Ug = Ug(Q_g) and Dg = Dg(Q_g) are the eigenvec- 
tors and diagonal matrices of the eigenvalues in d?), and fi^ is 
the water-level chosen to satisfy Tr {(/^gl — '-'g ^)^} = ^q- 

Using (O, we can now characterize the Nash Equilibria of 
the game ^ in a compact way as the following waterfilling 
fixed-point equation: 



Q;=WFg(Q*,) 



Vg G n. 



(10) 



Remark 1 - Related works: The matrix nature of game 
and the arbitrary structure of the channel matrices make the 
analysis of the game quite complicated and none of the results 
in the literature L6|-[13| and [21J, [16] can be successfully 
applied to The main difficulty in the analysis of the solution 
in ([Tol l comes from the fact that the optimal eigenvector 
matrix U* = Ug(Q*^) of each user g depends, in general, 
on the strategies Q^^ of all the other users, through a very 
complicated implicit relationship— the eigendecomposition of 
the equivalent channel matrix H^RlJ(Q*)Hgg. 

The goal of this paper is to provide a complete analysis of 
game without making any assumption on the structure of 
the channel matrices. Surprisingly, in the case of rectangular 
channel matrices, most of the key results given in |16||f| such 
as the interpretation of the MIMO multiuser waterfilling as a 
matrix projection onto a convex set |16, Lemma 6] and the 
contraction theorem for the multiuser MIMO waterfilling op- 
erator |16, Th. 5], fail to hold, implying that a new framework 
is needed for the more general case of rectangular, possibly 
singular, channel matrices. We focus on this issue in the next 
section. 

III. Properties of the MIMO Waterfilling Mapping 

In this section we derive some interesting properties of the 
multiuser MIMO waterfilling mapping. These results will be 
instrumental to study the uniqueness of the NE of game 
and to derive conditions for the convergence of the algorithms 
proposed in Section |V] Differently from previous works in 
the literature ll6l- lfT6l . where only channel matrices with 
special structure— nonsingular diagonal [i6l- |[T3l . nonsingular 
Toeplitz circulant [14 1, [15], and nonsingular MIMO square 
[161 matrices— were considered, here we do not make any 
assumption on the structure of the channel matrices, which 
can be rectangular and (possibly) rank deficient matrices. 
The main result of the section is a contraction theorem for 
the multiuser MIMO waterfilling mapping, valid for arbitrary 
channel matrices. 

A. MIMO waterfilling as a projector 

In IIT5I . we showed that the waterfilling operator for SISO 
frequency-selective channels can be interpreted as a projection 
onto a simplex set. This gave us a key tool to prove the 

'Note that, because of the space limitation, most of the results in 1161 
dealing with the MIMO case were given without proof. 
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convergence of the iterative waterfilling based algorithms 
in the multiuser case. This interpretation can be naturally 
generalized to the MIMO case for square and nonsingu- 
lar channel matrices [|16J Lemma 6]: given the nonsingular 
matrix H^RlJ(Q_q)Hgg, the MIMO waterfilling operator 
WFq (Q-g) in ^ can be equivalently written as 



equilibria of game ^ can be alternatively obtained as the fixed- 
points of the mapping defined in (fT2l l: 



WF, (Q_,) = [- (Hf,R:i(Q_,)H, 
where i2q is defined in 



(11) 



Negative result #1: One could conjecture that the more general 
case where the channel matrix H^RlJ(Q_q)Hgg is singular 
follows naturally from (fTTT i by replacing the inverse with some 
generalized inverse ||27l . Unfortunately, this conclusion is not 
true. In the general case of possibly singular channel matrices, 
the equivalent expression of the MIMO waterfilling operator 
as a projection contains an additional term— the orthogonal 
projection matrix onto the null space of the direct channel 
matrix— as stated next. 

Lemma 1: The MIMO waterfilling operator WF^ (Q-g) in 
(|9]l can be equivalently written as 



WFg(Q_g)=[- ((Hf,Ri;(Q_g)H, 



CgP^(H, 



Q; = [- ((Hf,R:;(Q%)Hgg)" +cP^(H„)jJ^ ,Vg G 

(14) 

for sufficiently large (positive) c. This equivalent expression 
of the Nash equilibria along with the non-expansive property 
of the multiuser waterfilling, will play a key role in the 
study of uniqueness of the equilibrium and convergence of 
the distributed iterative algorithms proposed in Section |V] 

B. Contraction properties of the multiuser MIMO waterfilling 
mapping 

Building on the interpretation of the waterfilling operator 
as a projector, we provide now one of the main results of 
the paper: the contraction property of the MIMO multiuser 
waterfilling mapping for arbitrary channel matrices. To this 
end, we introduce first some basic definitions and results that 
will be used in our derivations. 

1} Intermediate definitions and miscellaneous results: 
Given the multiuser waterfilling mapping 

WF(Q) = (WFg(Q_g))gej, : ^ ^ ^, (15) 

where =S = =Si x • • • x ^g, ^g and WFg(Q_g) are defined 
in (|6]l and (fT2] i. respectively, we introduce the following block- 
maximum norm on C" ^ " , with n — jiti + . . . + utq , defined 
as 



(if) 

where Cq is a positive constant satisfying Cq > Cg(Q-g) = 
Pq + maxigji ,,,} [Dg(Q-g)]^^, and J3q is defined in 
An upper bound of Cg(Q_g) independent on Q-g is given in 
(|93] l in Appendix |A] 

Proof: See Appendix lAl ■ 
Observe that, for each q ^ il, P^/(Hgg) in (fT2l) depends 
only on the channel matrix Hgg (through the right singular 

vectors of Hgg corresponding to the zero singular values) and ^^^^^ || . ^j^^ Frobenius norm and w ^ , . . . , wgf > 

" „.„ be the vector 



|WF(Q) 



|WFg(Q_g)||^ 



I _F, block 



qen 



(16) 



not on the strategies of the other users, since R_g(Q-g) is 
positive definite for all Q_g e i?-g. 

Remark 2 - Special cases: Since the expression in (fTSl i is 
valid for any arbitrary set of channel matrices, it contains as 
special cases previous results obtained in ifTSl and llT6l when 
the channel matrices have a particular structure. For example, 
since P7v^(h<,,) = if and only if Hgg is full column-rank, 
in the case of square nonsingular channel Hgg, the MIMO 
waterfilling projection in (fTZt coincides with ( fTTT i [16|. 

Non-expansive property of the waterfilling operator: Thanks 
to the interpretation of the MIMO waterfilling in (|9]l as a 
projector, one can obtain the following non-expansive property 
of the waterfilling operator lfT6l Lemma 7]. 

Lemma 2: The matrix projection [■]^ onto the convex 
set ^q defined in (|6]l satisfies the following non-expansive 
property: 

- [Y]^^ 

(13) 

□ 



is any positive weight vector. Let 
weighted maximum norm, defined as 



max — for vir > 0, x G 

gGSl Wq 



(17) 



and let j^^j denote the matrix norm induced by IHloo ^ec ' 
given by |26 | 



1 ^ 



= max — 2,|[A]gr|wr, for Ag 



TjQxQ 



q w, 



(18) 

Given the set ^ = (P^g),.^,, with each P^g g C"''" , 
we introduce the nonnegative matrix S(^) G 



pQxQ 



defined 



as 



<||X-Y||p, VX,Yg 



\a( a^w A / P (H^gPrgH^^ H^^PrgHrg) , if r 7^ g, 
[ y^)\qr I 0, Otherwise. 

(19) 

For the sake of notation, we use the following convention: 
when each P,.g = I (i.e., ^ = (I)), then we denote S((I)) 
by S, i.e.. 



Nash equilibria as fixed-points of the WF projection: Using 
(fTZt in ([Tol l, it is straightforward to see that all the Nash 



[S]g. ^ [S((I))] 



qr 



p(Hf^HafH«^H,,g), ifr^g, 
0, otherwise. 

(20) 
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Finally, given the following definition of interference-plus- 
noise to noise ratio: 



innr„ = 



1-— ; ^>l, 'zef^, (21) 



'^min (Rnq 



up TTT^Q'XQ 



as 



let define S"p e 

^ r innr, • p (Hf^H,,) p (H«f H« J , if r ^ q, 
^ ^1'' \ 0, otherwise. 

(22) 

We provide now a couple of lemmas that will be used in the 
forthcoming derivations (we omit the proof because of the 
space limitation). 

Lemma 3: (Reverse order law for Moore-Penrose pseudoin- 
versesO) Let H S C™^" with rank(H) = m (i.e., full rank fat 
matrix), and R G C"^" with rank(R) = n. Then, 

(H^RH)* = H^R-^H"^. (23) 

□ 

Interestingly, a generalization of Lemma [3] that is valid also 
for generalized inverses (cf. ||27| ) of the product of matrices 
can be found, e.g., in ||28l Th. 2.2], under some conditions. 



Negative result #2: In the case of (strictly) full column-rank 
matrix H, the reverse order law in ( l23T l is not satisfied anymore 
(the necessary and sufficient conditions given in |28, Th. 2.2] 
are not satisfied). In fact, in such a case, invoking the matrix 
version of the Kantorovich inequality |31, Ch. 11], one can 
prove that the following relationship exists between the LHS 
and RHS in 



(H^RH)" < H«R iH«^. 



(24) 



This undesired result is one of the main reasons why the 
derivations obtained in |16|, under the assumption of square 
nonsingular channel matrices (for which Lemma[3]holds true), 
cannot be generalized to rectangular and/or rank deficient 
matrices. We thus need new results, as given in Section iriI-B3l 

Finally, we need the following. 



Lemma 4: Let X = X^G 



and A e 



\ Then, 



AXA 



<p(A^A) ||X|| 



(25) 
□ 

We can now focus on the contraction properties of the 
multiuser MIMO waterfilling operator We will consider only 
the case where all the direct channel matrices are either 
full row-rank or full column-rank, without loss of generality 
(w.l.o.g.). The rank deficient case in fact can be cast into 
the full column-rank case by a proper transformation of 
the original rank deficient channel matrices into a lower- 
dimensional full column-rank matrices, as shown in Appendix 
|C](see also Section HVb. 

2) Case of full row-rank (fat/square) channel matrices: 
We start assuming that the channel matrices {Hqqjqgo are full 
row-rank. The contraction property of the waterfilling mapping 
is given in the following theorem, which generalizes [16. Th. 
5]. 



Theorem 5 (Contraction property ofWF mapping): 
Suppose that rank(Hgq) = n-fj^, \/q G (i.e., 
full rank fat/square matrices). Then, for any given 
w = [wi, . . . ,wq]'^> 0, the mapping WF defined in 
([15]l is Lipschitz continuous on J2: 

||WF(Q(i)) ~ WF(Q(2))||;^,^^^ < ||S||-_, 

11^ ^ ll_F.block ' 

(26) 

VQ(i),Q(2) G where Hl,,,,^, and S are 

defined in (116b . (118b and (120b . respectively. Furthermore, if 
the following condition is satisfied 



< 1, for some w > 0, 



(27) 



then, the mapping WF is a block-contraction with modulus 

P = l|S|loo,maf 

Proof: Given Q(i) = {q!q\ Q^q'^^ G ^ and Q^^) = 
(^qf \ . . . , Q^^^) G ^, let define, for each q e n, 



ewF, - 

A 

en = 



WF, 


{< 


-WF, (qL^^) 


7 

F 


(28) 




-Qf 


f' 




(29) 



where, according to Lemma [T] each component WFg(Q_q) 
of WF can be rewritten as in ( fT2b . with R_g(Q_g) defined 
in (|4]i. Then, we have: 



ewF„ 



H„ 



(Hf,R-i(QW) 

- (^h^r^^(qL^^)h, 



CqPA/-(H„) 

- c,Paa(h„) 



< 



H„ 



<^p(Hf,H«fH«,H.,)||QW-Q(^) 



r^q 



r^q 



(31) 
(32) 

(33) 
(34) 



VQ(i), Q(2) G ^ and G fl, where: (O follows from 
(O (Lemma [B with c, > max (^c,(qL^^), c,(qL^])); (EB 
follows from the non-expansive property of the projector in 
the Frobenius norm as given in il3[ (Lemma [2]) ; ( |32] ) follows 
from ( |23] | (Lemma O and the assumption rank(Hgg) = nj^ , 
Vg G 51; ( l33] l follows from the triangle inequality |26| and 
( l25b (Lemma |4]l; and in (|34|) we used the definition of S as 
given in ( l20b . 

Introducing the vectors 

ewF - [ewFi, ■ • • , ewFq]^, and e = [ei, . . . , eg]^, (35) 
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with ewF, and Cg defined in ( |28T l and ( |29] l. respectively, the 
set of inequalities in ([34]) can be rewritten in vector form as 



< ewF < Se, VQ(i\ Q^^) e ^. 



(36) 



Using the weighted maximum norm || ||^ ^.g^, defined in (fTTT i 
in combination with d36] l. we have, for any given w > 0, 



ll^'WFllocvec ^ 



ISeir < 

I lloo.vec — 



SC,„J|eC,ec- (37) 
is the matrix norm induced 



VQ(i),Q(2) e ^, where 
by the vector norm |M|^ ^g^, in (fTTT i and defined in (fTSl l L26 
Finally, using (I37]i and ([iSli, we obtain, 



WF(Q^^^) - WF(Q 



(2)- 



F.block 

< lisr 



ewF 



(2)1 



I _F,block ' 

(38) 



VQ(i\Q(2) G ^ and Vw > 0, which leads to a block- 



contraction for the mapping WF if ||S| 
condition ( |27] |. 



oo,mat 



< 1, implying 



Negative result #3: The waterfilling mapping WF satisfies the 
Lipschitz property in ( l26b if the channel matrices {H^qlggji 
are full row-rank. Surprisingly, if the channels are not full 
row-rank matrices, the property in ( |26] | does not hold for 
every given set of matrices {^qq]q^u. implying that the WF 
mapping is not a contraction under dZTl i. 

Numerical counter-example: Consider a game with two play- 
ers. The Lipschitz property of the WF as given in Theorem |5] 
can be written, e.g., for player 1 as (a similar expression has 
to be satisfied for player 2) 



WFi(qW) 



WFi(Q 



(2) 



Q 



(39) 

VQ^^\Q*^^^ G =S. The following set of channels and users' 
covariance matrices does not satisfy the inequality in ( [39] l [and 
thus dm]: Pi=P2 = 10, Ri = I, R2 = I, 



Hi 



Ho 



and 



1 — 



H 



22 — 



0.5458 4 
-2.1758- 
-1.0132 



0.0819i 
-0.781H 
- 1.1376i 



-0.5449 
-1.9082 
-1.8198 



1.870H 
0.9013i 
0.1200i 
(40) 



H12 



0.5865 -I- 0.4392i 
1.5959 - 0.2853i 
-0.1035 + 2.0967i 



The 



above 



9.9175 
-0.8946 

8.5842 
-1.2150 



setup leads 



to 



WFi(Q^^O-WFi(Q^^O 
p (nl^iHjf H}iH2i) = 2.5012, and 
1.9392. Since p (li^^U{" ll{^li2i 



1.4387- 2. 2133i 
-1.5410- 0.2285i 
-0.3196+ 1.0228i 

(41") 

(42) 

(43) 

following: 
5.2925, 



-0.8946 
0.0825 

-1.2150 
1.4158 



the 



4.8502, condition ( |39] | is not satisfied. Since this happens for 
both users, there exists no vector w > such that condition 
(|26] | can be satisfied [observe that for the setup above we 
have ||SC„,^. = p(Hf,H«fH«iH2i)]. 

The above example shows that ( |26] | is not true for (strictly) 
tall matrices and, therefore, condition ( |27] | is not enough 
to guarantee the WF mapping to be a block-contraction, 
implying that in such a case stronger conditions are needed. 
This is due to the fact that the reverse order law for generalized 
inverses of product of matrices (Lemma |3]l does not hold if 
the outer matrices involved in the product are (strictly) tall 
[recall that the inequality in (l24l l always holds]. It turns out 
that, in such a case, a different approach is required to derive 
conditions guaranteeing the WF mapping to be a contraction. 
We address this issue in the next section. 

3) Case of full column-rank channel matrices: The main 
difficulty in deriving contraction properties of the MIMO 
multiuser waterfilling mapping in the case of (strictly) tall 
channel matrices {Hg^lggfj is that one cannot use the reverse 
order law of generalized inverses given in Lemma |3] as done 
in the proof of Theorem |5] [see ([3TTi-(l32]i]. To overcome this 
issue, we develop a different approach based on the mean- 
value theorem for complex matrix-valued functions, as detailed 
next. 

Mean-value theorem for complex matrix-valued functions: The 
mean value theorem for scalar real functions is one of the most 
important and basic theorems in functional analysis (cf. 1341 
Th.5.10], EU Ch.5-Th.l0]). In this paper, we will use the 
simplest form of the theorem, as stated next. 
Mean value theorem for real scalar functions II34I Th.5.10]: 
Let / : [a, 6] 1-^ M be a real continuous function on [a, 6j, 
differentiable on (a, h) with (first) derivative denoted by / . 
Then, 

3 t G (0, 1) I f{b) - /(a) ^f\tb+{l-t) a) (b-a). (44) 

Negative result #4: Unfortunately, the generalization of (l44b 
to vector-valued real functions that one would expect does not 
hold, meaning that for real vector-valued functions f : 2? C 
R™ h-^ R" in general 

G (0,1)1 f(y)-f(x) =Dxf(<y+(l-<)x)(y-x), (45) 

for any x, y G I? and x y, where Dxf denotes the Jacobian 
matrix of f (cf. Appendix iBli. One of the simplest examples 
to illustrate ( l45T l is the following. Consider the real vector- 
valued function f (a;) = [x", x*^]^, with x G M and, e.g., a — 
2, P = 3. There exists no value of t G (0, 1) such that f (1) = 
f(0)+Dtf(i). 

Many extensions and variations of the main value theorem 
exist in the literature, either for (real/ complex) scalar or real 
vector-valued functions (see, e.g., Il35l . 136]). Here, we provide 
an extension of ( l44l l in a form that is useful to our purpose, 
valid for complex matrix-valued functions. Interestingly, our 
result shows that (l44l l can be generalized with inequality to 
(complex) vector/matrix-valued functions if one: i) takes the 
norm on both sides; and ii) relaxes the equality with the 
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inequality]^ 

Lemma 6: Let F(X) : P C C"^" C^^* fee a complex 
matrix-valued function defined on a convex set V, assumed to 
be continuous on T> and dijferentiable on the interior of V, 
with Jacobian matrix DxF(X)0 Then, for any given X, Y G 
J), there exists some t G (0, 1) such that 

||F(Y)-F(X)||^< ||DxF((<Y 



be checked in practice, since they do not depend on the set 



Corollary 8: Given and S"p defined in ( 1191 ) and 

respectively, with given in Theorem^ we have: 



1 2. mat 



(l-t)X))vcc(Y-X)||2 
(46) 

< llDxF ((t Y + (1 - i) X))||2 ||Y - X||^ 

(47) 

= \J p(A^A) denotes the spectral norm of 



implying 



where 
A. 

Proof: See Appendix iB] ■ 

We can now provide the contraction theorem for the 
WF mapping valid also for the case in which the channels 
{Hqqiggsi are full column-rank matrices. 

Theorem 7 (Contraction property o/WF mapping): 
Suppose that rank(Hgg) = nr,, G f2 (i.e., 

full rank tall/square matrices). Then, for any given 
w — [wi, . . . ,wq]^> 0, the mapping WF defined in 
dlSb is Lipschitz continuous on =S: 



|WF(Q(i)) - WF(Q(2))| 



F,block 



< 



l|S(^*)||S,„., 

X ||Q(1) _Q(2)| 



with 



iF.block ' 



I F.block ' 

(48) 

jjijij and S{^*) defined in (fT6] l. (fTSl l and 



( fT9] l. respectively, where 3^* — {P*^)r^q and each P*^ is a 
solution to the following optimization problem: 



maximize 
subject to 



y'-'-rq'^ rq'-'-qq '-^qq^ rq^i-rq j 



P 

rq 

7^(Prg) 
M{Vrq) 



p2 

rq 



:AA(Hf,R_,(Q_,)) 



Furthermore, if the following condition is satisfied 



|S(^*) 



< 1, for some w > 0, 



(49) 



(50) 



then, the mapping WF is a block-contraction with modulus 

/?= 118(^-^)11- □ 

Observe that stronger sufficient conditions for the WF 
mapping to be a contraction than ( fSOl ) can be obtained by 
solving any relaxed (simpler) version of the optimization 
problem in ( l49l l. For example, to simplify the optimization, 
one can remove the last constraint in ( |49] l. However, solving (a 
relaxation of) ( |49] l may still no be easy. To overcome this issue 
and give additional insight into the physical interpretation of 
the conditions we obtained, we provide the following corollary, 
which contains sufficient conditions for ( l50b that are easier to 

^Basic results on complex differential calculus for matrix-valued functions 
along with some intermediate results useful to prove Lemma [6] are given in 
Appendix |B] 

^ Since the complex matrix- valued functions we are interested in depend 
only on X (and not on X*), we consider the mean value theorem only for this 
case. The generalization to the complex matrix-valued functions depending on 
both X and X* follows similar steps. Furthermore, since our functions are 
continuous and differentiable on a convex set, we made these assumptions in 
the theorem. However, the same statement can be established under weaker 
assumptions. 



is(^*)ii: 



< S"P, (51) 



< l|S"P||^,„,., foranyw>0. (52) 



□ 

Proof of Theorem [71 The proof follows the same guidelines 
of that of Theorem |5] with the key difference that, in the 
case of (strictly) full column-rank channel matrices, we cannot 
use the reverse order law (Lemma O as done to obtain dSTl i- 
( l32l ) in the proof of Theorem |5] We apply instead the mean- 
value theorem in Lemma |6l as detailed next. For technical 
reasons, we introduce first a proper complex matrix-valued 
function Fq(Q_g) related to the MIMO multiuser waterfilling 
mapping WFg(Q_g) in ^ and, using Lemma|6] we study the 
Lipschitz properties of the function on JS-q. Then, building 
on this result, we show that the WF mapping satisfies (|48] |. 

Given q G fl, let us introduce the following complex matrix- 
valued function Fg(Q_q) : i— > defined as: 



F,(Q_,) = (H«RlJ(Q_,)H,,)- 



(53) 



with R_q(Q_q) given in (|4|. Observe that the function 
Fg(Q-q) is continuous on J^^q (implied from the continuity 
of Rl;^(Q_g) at any Q q >z (&) and differentiable at any 
Q_g >^ (cf. Appendix iBl). The Jacobian matrix of Fg(Q_q) 
with respect to Q-q is (see Lemma [TT] in Appendix iBt: 



Dq-,f(q_,) 



[GJ,(Q_,)®Gi,(Q_,),..., 
G;_i,(Q-,)®G,_i,(Q_,), 



G 



q+l(}(Q-«) 
G*QqiQ-qJ 



Gg+lg(Q_g), . . . , 

Gq,(Q_,)] , (54) 



where 



Grq{Q-q) - {B.fgR_g{Q^q)Hqq) H^R_ J (Q_g)Hrg . 

(55) 

Observe that DQ^F(Q_g) is well-defined and continuous at 
any Q_g ^ 0. 

Therefore, function Fg(Q_g) satisfies the assumption of 
the mean-value theorem in Lemma |6] meaning that, for 
any two different points qL^^,qL^^ G ^-q, with qL*), = 
\Q^,...,q!;^„q!^1^, •••,Qg^] for * = 1,2, there exists 
some t G (0, 1) such that, introducing 

A 4 tqW + (1 - t)Q% (56) 

*This result can be proved using f27' Th. 10.7.1]. 
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we have: 



< ||DQ_,F,(A)vec (qL'^-Q 



<^||G:^(A)®G,,(A)| 
= ^p(Gf^(A)a,(A)) 



2, mat 



(57) 
(58) 

(59) 



< p 



Hf^Pf^(A)H»;H«^P,,(A)H 



rq 



(65) 



H^P* H« P* H 



^ [S(^*)]^, ,(66) 



where (ISTT i follows from ( |46] ) (Lemma |6]); (fSHT l follows from 
the structure of Dq^F^ [see ( |54] |1 and the triangle inequality 
and ( |59] ) comes from the following chain of equalities: 



p [(G^, GfJ (g;, G.,)] 



p [g^^g* 

= (p[Gf,G.,])', (60) 

where the last equahty in (|60| follows from 129} Th.21.11.4] 
[implying p(G^^G;^® G^^^G,,) - p(G^,G;,) p(G,«, G,.,)] 
and the fact that the eigenvalues of G^^G*^ coincide with 
those of G^Grg. 

Observe that, differently from ([33])-(l34l), the factor 
arq{^) = p [G^(A)Grg(A)] in (jsg depends, in general, 
on both t e (0, 1) and the covariance matrices Q^^ and Q^^^ 
through A [see (|56]l1: 



where in (|65] | we used (|62] | and (l63T l: and ( |66] | follows from 
the definition of ^* as given in ( |49] l and the characterization 
of Prg(A) as given on d64] i. 

The Lipschitz property of the WF mapping as given in 
(|48] l comes from ( |59] l and ( |66] l. using the same steps as in the 
proof of Theorem |5] We omit the details, because of space 
limitations. 

Proof of Corollary |S] We prove now the corollary, providing 

independent of 
it follows 



q ^ G^Grg] an upper bound of each [S(^*)]^^ in 



J qr 

J. rq- For any fixed q and r ^ q, given A in 
from (|65j that 

a.,(A)<p(Hf^H,g)p(H«;H«, 
xpfR:f(A)P^. 



— (A)H„)I^-.(^) (67) 



xP„ 



■R[ R 



-1/2 



(A) 



(68) 



arq{A) 



H,^El_g(A)Hgq (H^R_J(A)H,,) 
X (Hf,R:i(A)H,,)-'Hf,Rli(A)H 



rq 

(61) 



< p (Hf^H,,,) p (h«;H«,) p (R-,(A)) p (RiJ(A)) 

(69) 

< p (H^^,H,,) p (h«; H« innr, (70) 
where (|68]l-(l69]l follow from 



where in (l6TT l we used ( l55T l. Interestingly, in the case of 
square (nonsingular) channel matrices Hgg, reduces to 



p(A^BA) < p(A^CA), for all r< B r< C, 

(71) 



'-'-rq'-'-qq '-'-qq'-'-rq 



[S]^^ , where S is defined and the fact that P 



arg(A) = p ^ 

in (|20] |. which leads to the same contraction factor for the WF 
mapping as in Theorem |5] 

We thus focus on the case of (strictly) full column-rank 
matrices H^q and look for an upper bound of arg(A), 
independent of A. To this end, we introduce the following: 
the orthogonal projection onto TZ [R_;^^^(A)Hqg ) , given by 



TC(R_^-'"(Ao)H„ 

jection, implying 



is an orthogonal pro- 



< I; and in (iTOl i we used 

p [R_g(A)] p [RI;^(A)] < innrg, where innr^ is defined in 
(ISTT i and the upper-bound comes from the following chain of 
inequalities: 



7z(r:J/^A)H„) 

Rif (A)Hg, (Hf,Rli(A)Hg 



Hf^RI ^^(A) 
(62) 



R„, < R-g(A) = R„, + Y.'^rq (tQ^'' + (1 - t)Qf'') ^?q 

r=^q 

< R„, + ^ Tr (tci-^^ + (1 - t)Q(' 



and the idempotent matrix Prg(A) [i.e., Prg(A) = P^^(A)], 
defined as 



R„ 



P.,(A)ArV|(A)P^ 



1/2 



(A)H, 



.Rif (A). 



(63) 



Observe that Prg(A) is still a projection (albeit non orthog- 
onal): the projection onto the subspace U of C""' along the 
subspace V, where lA and V are given by (observe that Hgg 
has an empty null space): 



where we used the fact that tQ^^^' + (1 - i)Qr^^ >l 0, for all 



.(2) 



(72) 



t e (0, 1) and r e n, and Tr(Q[^^) = Tr(Qj.^O = ^'r for all 
r en. 

The inequality in (ISTT i follows directly from (|66] | and (iTOl i 
[observe that the upper bound of arq{A) in ( iTOt is strict and 
does not depend on A], which completes the proof. ■ 

Comparing Theorems |5] and |7] (see also Corollary [8]), one 
infers that conditions for the multiuser MIMO waterfilling 
mapping to be a block-contraction in the case of full column- 
rank (rectangular) channel matrices are stronger than those 



v(2) 



U = n{Prq{A))=TZ{B., 



(64) 



and U®V 
(leTT i as: 



-qqjj 

V=AA(P.g(A))=AA(Hf,Rli(A)), 
— C"'^'' . Using the above definitions, we can rewrite 
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required when the channels are full row-rank (rectangular) 
matrices. Interestingly, both conditions coincide if the (direct) 
channel matrices are square nonsingular [in fact, rank(P.rg) = 
flu , implying that the feasible set of ( |49] l contains only the 
point Prq = I]- More specifically, for any given set of channel 
matrices (either tall or fat), the relationship existing among 
conditions in Theorems |5] and |7] is the following: 



S < S( 



(73) 



where S, S(^*) and S^p are defined in and 
respectively, with ^* given in Theorem|7] The lower-bound in 
(|73T l is indeed reached when the channels {Hqq}q^fi are square 
nonsingular matrices. A physical interpretation of sufficient 
conditions ( |27| i and ( ISOl l is given in the next section where, 
building on Theorems |5] and [T] we provide a unified set of 
sufficient conditions guaranteeing the uniqueness of the NE 
of game 

IV. Existence and Uniqueness of the NE 

Using the results obtained in the previous section, we can 
now study game and derive conditions for the uniqueness 
of the NE, as given next. Differently from current works in 
the literature ||6l-|fT6l, the proposed conditions are valid for 
any given set of arbitrary channel matrices, either tall/fat or 
singular. 

Before stating the main theorem, we introduce the following 
intermediate definitions. Given S(i^), S and S"p defined in 
([T9]l, daOli and respectively, let S e M!^'"^ and S"p e 
j^QxQ nonnegative matrices, defined, for each r, q £ 17, as: 







s 






qr 1 



and 







gup 






qr 1 



J qr 1 



if rank(Hgq) = n^^ 
otherwise. 



if rank(Hgg) = nn^, 
otherwise. 



(74) 



(75) 



where 3^* is given in Theorem |7] 

A unified set of sufficient conditions guaranteeing the 
uniqueness of the NE of are given in the following theorem. 

Theorem 9: Game J# always admits a NE, for any set of 
channel matrices and transmit power of the users. Further- 
more, the NE is unique if 



p{S) < 1, 



(CI) 



where S is defined in (174b . 

Proof: See Appendix ICl ■ 

Even though condition ( ICl I ) is valid for any set of channel 
matrices, either tall/fat or singular, it may not be easy to 
check it, when some (direct) channel matrix is (strictly) tall or 
singular, because of the difficulty in the computation of the set 
To overcome this issue, we provide in the following a set 
of sufficient conditions for (IClb that are easier to be checked, 
still valid for arbitrary channel matrices. 



'in the case of rank deficient channel matrices {Hgqlggn, one can obtain 
weaker condition than )CU . as given in Appendix |C| [see )127H . 



Corollary 10: A sufficient condition for ( |C7| ) in Theorem^ 

is 

p(S"P) < 1, (C2) 

where S"p is defined in ( 1751 ). □ 

Finally, to give additional insight into the physical interpre- 
tation of sufficient conditions for the uniqueness of the NE, 
we provide the following. 

Corollary 11: //'rank(Hgg) — ri^^ for all q E il, then a 
sufficient condition for ( ICiD in Theorem |2] is given by one of 
the two following set of conditions: 

Low MUI received: 

-l^p(Hf,H«fH«,H.,)u;.<l, Vg G 17, (C3) 

Low MUI generated: 

q=^r 



where w = [wi, 



uq]^ is any positive vector. 



If rank(Hgg) < ny,, for all q £ fl, then a sufficient 
condition for (|C7I) is given by one of the two following set 
of conditions]}^ 



H 



qqjWr<l, £ f7, 

(C5) 



(C6) 

□ 



r=iq 

Low MUI generated: 
— J2 innr, -p (H^^H,,) p (h«" 

q=ir 

where the innr^ 's are defined in ( 1271 ) 

Remark 3 - On the uniqueness conditions. Conditions ( IC3b - 
( IC4I ) and ( IC5b -( IC6l ) provide a physical interpretation of the 
uniqueness of the NE: as expected, the uniqueness of the NE 
is ensured if the interference among the links is sufficiently 
small. The importance of (IC3])-(IC4b and (|C5ll-(|C6ll is that they 
quantify how small the interference must be to guarantee that 
the equilibrium is indeed unique. Specifically, conditions iC3i 
and (IC5b can be interpreted as a constraint on the maximum 
amount of interference that each receiver can tolerate, whereas 
( IC4I ) and ( IC6I ) introduce an upper bound on the maximum level 
of interference that each transmitter is allowed to generate. 
Surprisingly, the above conditions differ if the channel matri- 
ces {H.qq}q^n are (strictly) tall or fat. More specifically, ( IC5b - 
(IC6b are stronger than (IC3b - (IC4b . implying that in the case of 
tall or rank deficient channel matrices a stronger constraint is 
imposed on the multiuser interference to guarantee the unique- 
ness of the NE. Furthermore, differently from ( IC3b - (IC4b . 
conditions ( IC5b - (IC6b depend also on the transmit powers of the 

'"The case in which some channel matrices Kqq are (strictly) tall and some 
others are fat or there ai'e rank deficient channel matrices can be similarly 
addressed: if rank(Hqq) < tir^ for some q (implying that Hq q is a (strictly) 
tall, possibly singular, or a fat singular matrix) we can still use (C3) [or jC4H 
where the g-th condition in iC3i [or the g-th term in the sum in jC4H is 
replaced by the q-th condition in )C5) [or the corresponding q-th term in the 
sum in )C6H . 
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users and the noise power, through the innr^^'s coefficients. 
This difference in the uniqueness conditions is due to the fact 
that, in the case of (strictly) tall channel matrices {Hgqjggj^, 
the multiuser waterfilling requires a stronger condition to be 
a contraction (cf. Sec. IIII-B3I ). 

Finally, observe that all the above conditions coincide in the 
case of square non singular matrices {H^gl^go and, in such 
a case, they contain, as special case, most of the conditions 
known in the literature |6 1-| 12 1 for the rate-maximization game 
in SISO frequency-selective interference channels and OFDM 
transmission. A detailed comparison between the general con- 
ditions (IClb in the case of nonsingular square MIMO channels 
and those obtained in the cited papers was recently given in 

m. 

To obtain weaker sufficient conditions guaranteeing the 
uniqueness of the Nash equilibrium (and the convergence 
of distributed algorithms) even in the case of tall channel 
matrices, which coincide with those obtained in the full row- 
rank case (the weakest ones), in Sec. IV-BI we propose a 
variation of the original game whose performance in terms 
of sum-rate is virtually the same as that of game ^. 

V. Asynchronous Distributed Algorithms 

In this section we focus on distributed algorithms that 
converge to the NE of the game. We consider totally asyn- 
chronous distributed algorithms, meaning that in the updating 
procedure some users are allowed to change their strategy 
more frequently than the others, and they might even perform 
these updates using outdated information on the interference 
caused by the others. We propose two different classes of asyn- 
chronous algorithms and show that, whatever the asynchronous 
mechanism is, both algorithms converge to the globally asymp- 
totically stable NE of the games with virtually the same 
performance, under conditions guaranteeing the uniqueness of 
the NE. 

A. The MIMO Iterative Waterfilling Algorithm 

To provide a formal description of the proposed asyn- 
chronous MIMO IWFA, we briefly recall some intermediate 
definitions, as given in |fT6l . We assume, without loss of gener- 
ality, that the set of times at which one or more users update 
their strategies is the discrete set T = N+ = {0, 1,2,...}. 
Let Qg"'' denote the covariance matrix of the vector signal 
transmitted by user q at the n-th iteration, and let 7^ C T 

(n) 

denote the set of times n at which Qq is updated (thus, 
at time n ^ Tq, Qg"^ is left unchanged). Let T^(ri) denote 
the most recent time at which the interference from user r is 
perceived by user q at the n-th iteration (observe that T-!(n) 
satisfies < T^(n) < n). Hence, if user q updates his own 
covariance matrix at the n-th iteration, then he chooses his 
optimal Qg""*, according to (|9]l, and using the interference level 
caused by 

Q;;r^"»,...,Qj«^"»). (76) 



Some standard conditions in asynchronous convergence theory 
that are fulfilled in any practical implementation need to 
be satisfied by the schedule {T^(n)} and {Tq}; we refer to 
lfT6l . lfT2l for the details. Using the above notation, the asyn- 
chronous MIMO IWFA is formally described in Algorithm 1 . 
Sufficient conditions that guarantee the global convergence of 
the algorithm are given in Theorem [121 

Algorithm 1: MIMO Asynchronous IWFA 

Set n = and Q^°^ £ ^q- 

for n = : Nit 

J WF (Ql-;<"») , if „er,. ^^^^^ 

[ Qq , otherwise; 

(77) 

end 

Theorem 12: Assume that condition ( IC7I ) of Theorem [9] is 
satisfied. Then, as Njt — > oo, the asynchronous MIMO IWFA, 
described in Algorithm 1, converges to the unique NE of game 
, for any set of feasible initial conditions and updating 
schedule. 

Proof: See Appendix ID] ■ 

Remark 4 - Global convergence and robustness of the algo- 
rithm. The global convergence of the nonlinear asynchronous 
MIMO IWFA is guaranteed under condition ( ICll l that, differ- 
ently from |,16i| where only square nonsingular channel ma- 
trices were considered, is valid for arbitrary channel matrices, 
either tall/fat or singular. 

Observe that Algorithm 1 contains as special cases a plethora 
of algorithms, each one obtained by a possible choice of the 
scheduling of the users in the updating procedure (i.e., the 
parameters {T^(n)} and {7^}). All these algorithms can be 
implemented in a distributed way, where each user, to maxi- 
mize his own rate, only needs to measure the covariance matrix 
of the overall interference-plus-noise and waterfill over this 
matrix. Two well-known special cases are the sequential and 
the simultaneous MIMO IWFA, where the users update their 
own strategies sequentially and simultaneously, respectively 
lfT6l . The important result stated in Theorem [T2l is that all 
the algorithms resulting as special cases of the asynchronous 
MIMO IWFA are guaranteed to reach the unique NE of the 
game, under the same set of convergence conditions, since 
conditions in dClb do not depend on the particular choice of 
{TJ and {r?(n)}. 

B. The MIMO Iterative Projection Algorithm 

So far we have seen that, in the case of (strictly) tall (or 
rank deficient) channel matrices, the conditions needed for the 
multiuser waterfilling mapping to be a contraction (and thus 
guaranteeing the uniqueness of the NE of 'S) are stronger 
than those required in the case of full row-rank channels. 
Furthermore, the former conditions depend also on the power 
budgets of the users, whereas the latter depend only on the 
channels. The difference in the two cases above is due to the 
fact that the reverse order law of generalized inverses of the 
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products of matrices (Lemma [3]), which plays a fundamental 
rule in the proof of the contraction property of the WF in the 
case of full row -rank matrices (Theorem |5]), does not hold in 
the case of (strictly) tall (or singular) matrices [see (l23Tl1. 

We would like to have a unified set of conditions guaran- 
teeing both the uniqueness of the NE and the convergence of 
asynchronous distributed algorithms, possibly independent of 
the users' transmit powers and valid for all channel structures. 
In this section we focus on this issue and propose a new game, 
based on a variation of the original waterfilling best-response 
mapping, having the desired properties: the uniqueness of the 
NE as well as the convergence of distributed algorithms to the 
NE are guaranteed under the same unified set of conditions 
that are valid for all channel matrices (fat/square/tall and 
singular/nonsingular) and coincide with those given for the 
game in the case of full row-rank channel matrices (the 
weakest one). This leads to a new game with, in general, 
different Nash equilibria with respect to the original game 
but with almost the same performance in terms of sum-rate. 
Interestingly, the two games coincide if all the (direct) channel 
matrices are square nonsingular (when Lemma [3] holds). 

To describe the new game, we focus w.l.o.g. only on the 
case in which the direct channel matrices are full column- 
rank matrices. The rank deficient case can be cast in the 
full column-rank case (cf. Appendix |C]l- The best-response 
mapping of the new game is based on the following simple 
idea, inspired to the proof of Theorem |5j to obtain the 
desired conditions for the uniqueness of the NE and the 
convergence of distributed algorithms, we would like to use 
as best-response of each user q the projection of the (minus) 
RHS in (l24l l onto the feasible set even in the case in 
which the reverse order law of the inverse in ( |24] | does not 
hold. This choice leads to the MIMO waterfilling solution 
WFq(-) in ^ if Lemma [3] holds true; otherwise it provides a 
different mapping that corresponds to a modified channel or, 
equivalently, modified game. Stated in mathematical terms, the 
proposed best response of user q, given the strategies of the 
others Q-g £ =S_ 



IS 



WF, (Q_ 



[-(H«,R-g(Q-g)H«f)] „ . (78) 



Interestingly, the proposed best-response mapping can be 
still interpreted as a waterfilling solution, but over a proper 
modified channel, as shown next. Invoking Lemma [T] [see also 
(fTTTil. ( Its] ) can be rewritten as 



WF, (Q_,) 



;h« R_,(Q_,)H«f) 



= WF, ( (H»,R_,(Q_,)H»f 



(79) 



(80) 



where, with a slight abuse of notation, we used the same 
notation as in (|9|l to denote the MIMO waterfilling so- 



lution over the channel 



We call 



,(Q_,)H«f) 

this new game based on the mapping in (IHOl l as = 
{l},{^,},60,{^g}geo}, whcre O = {1,---,Q}, ^, is 
defined in (|6]l, and i?g(Q) is the payoff of user q, defined 



as 

i?g(Qg, Q-g) - logdet (l + (H«,R_,(Q_,)H«f )-1Q,) . 

(81) 

It follows then that, in game each player, given the 
strategy of the others, still performs a MIMO waterfill- 
ing solution, but over the modified equivalent channel 
(HjqR_g(Q_,)Hj^) , which is in general different from 
the one used in the original game since the reverse order 
law of the inverse does not hold when the channel matri- 
ces Hg, are (stiTctly) tall, i.e., (H«^R_,(Q_,)H«f ^ 
H^RlJ(Q_,)Hgg [see (I24l i1. The two best-responses 
WF, (Q_,) in © and WF, (Q_,) in ^ coincide if the 
channel matrices H,, are square nonsingular 

The full characterization of game is given next. 

Theorem 13: Game based on the mapping in always 
admits a NE, for any set of channel matrices and transmit 
power of the users. Furthermore, suppose that the following 
condition holds true 

P(S)<1, (C7) 

where S is defined in i20\ . Then, the NE of the game is unique 
and, as Njt — *■ oo, the asynchronous MIMO IWFA described 
in Algorithm 1 and based on the mapping in f l78l ) converges 
to the unique NE of game J^, for any set of feasible initial 
conditions and updating schedule. □ 



VI. Numerical Results 

In this section, we provide some numerical results validating 
our theoretical findings. More specifically, we compare the 
performance of games and in terms of conditions 
guaranteeing the uniqueness of the NE, achievable rates, and 
convergence speeds of some of the proposed algorithms. 

Example #1: Probability of the uniqueness of the NE and 
convergence of the asynchronous TWFA. Since the conditions 
guaranteeing the uniqueness of the NE of games and 'i§ 
and convergence of the asynchronous IWFA given so far 
depend on the channel matrices {Hrg}r,geJ7, there is a nonzero 
probability that they are not satisfied for a given channel 
realization drawn from a given probability space. To quantify 
the adequacy of our conditions, we tested them over a set 
of random channel matrices whose elements are generated as 
circularly symmetric complex Gaussian random variables with 
variance equal to the inverse of the square distance between 
the associated transmitter-receiver links (flat-fading channel 
model). We consider a MIMO multicell cellular network as 
depicted in Figure |l(a)[ composed of 7 (regular) hexagonal 
cells, sharing the same band. Hence, the transmissions from 
different cells typically interfere with each other. For the 
simplicity of representation, we assume that in each cell 
there is only one active MIMO link, corresponding to the 
transmission from the base station (BS) to a mobile terminal 
(MT). The overall network can be modeled as a 7-users 
interference channel, where each link is a MIMO channel. 
In Figure [T] we plot the probability that conditions dClb (red 
line curves) and ( IC2b (blue line curves) are satisfied versus 
the (normaUzed) distance d G [0; 1) [see Figure [T(a)) between 
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(a) Multicell cellular system. (b) Probability of the uniqueness of the NE and convergence of the asynchronous IWFA. 

Fig. 1. Probability of the uniqueness of the NE of games 5f and and convergence of the asynchronous IWFA; (5 = 7, tit, ~ riR^ = 2 
(square marks), tit, = 2 and ur^ = 4 (circle marks), tit, = 4 and ur^ = 2 (cross marks), tit, = Jifl, = 4 (star marks), riT, = 4 and 
n_R, = 5 (triangle marks), and Pq/u^ ~ 5dB, Vq £ f2. 




^Game G 



^ 2 3 4 5 6 7 8 9 10 11 12 13 14 




sr (bit/cu) 



(a) Average sum-rate at the NE of ^ and . (b) Probability that the sum-rate of ^ and ^ exceeds a given value. 

Fig. 2. Performance of games 5^ and 5^ in terms of Nash equilibria: Average sum-rate at the NE versus drq/dqq [subplot (a)] and probability 
that the sum-rate exceeds a given value [subplot (b)] versus the sum-rate values, for both games W (cross marks dashed-dot blue line curves) 
and (cross circle solid red line curves); Q = 3, Pr = Pq, sniq = Pq/ffq = 3dB, (nr, , nn, ) £ {(2, 4), (3, 5), (4, 6)} for all r,q £ Q,. 



each MT and his BS (assumed to be equal for all the MT/BS 
pairs), for different values of the transmit/receive antennas. 
We simulated (full rank) square, fat and tall channel matrices: 
nr — nji = 2 (square marks), nx = 2 and rifj =4 (circle 
marks), = 4 and n^j =2 (cross marks), ut = nn =4 
(star marks), nj-, — 4 and rifi^ =5 (triangle marks), for all 
q = 1, . . . , Q. According to Theorems |9] and [T3] we have the 
following interpretation for the probability curves plotted in 
Figure [T] if the channel matrices are (full rank) square or fat, 
then condition (ICll l guarantees the uniqueness of the NE (and 
convergence of the IWFAs) of game in the case of (full 
rank) tall channel matrices instead, condition (IClb applies to 



game J# whereas (IC2l i is valid for game J^. Looking at Figure 
[T]the following comments are in order: 

- As expected, the probability of the uniqueness of the NE of 
both games and and convergence of the IWFAs increases 
as each MT approaches his BS, corresponding to a decrease 
of the intercell interference; 

- Sufficient condition (IC2l l is stronger than (IClb if computed 
over the same set of channels, as proved in ( |73] | (see also 
Theorem|9]and CorolIarvfTOli. For example, in the case of full 
column-rank 2x4 MIMO channels. Figure [T] shows that, for 
any given distance d £ [0,1), the probability that condition 
(IC2b holds true (circle-marks blue curve) is lower than that of 
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condition dCll ) (circle-marks red curve), implying that game 
admits weaker uniqueness/convergence conditions than those 
of the original game 

- Increasing the number of antennas at both the transmitter and 
receiver side leads to a decrease of the uniqueness/convergence 
probabilities (at least for the setup considered in the figure). 
For example, in the case of square MIMO channels, the curve 
associated to condition ( ICll l in the case of 2 x 2 MIMO 
channels (red square-marks curve) leads to better probability 
values than that obtained in the case of 4 x 4 MIMO channel 
(star-marks red curve), for any given distance d. The same 
result can be observed also in the case of tall channel matrices: 
condition ( |C2t for 2x4 channel matrices (circle-marks blue 
curve) has an higher probability to be satisfied than the same 
condition in the case of 4 x 5 MIMO channels (triangle-marks 
blue curve). 

Example #2: Game vs. game . In Figure |2] we compare 
the performance of games and in terms of sum-rate. 
Specifically, in Figure |2(a)[ we plo£ the sum-rate at the 
(unique) NE of the games and J# for a three-user flat 
fading MIMO system as a function of the inter-pair distance 
among the links, for different number of transmit/receive 
antennas. In Figure |2(b)| we plot the probability that the 
sum-rate exceeds a given value as a function of the sum-rate 
values, for the same systems as in Figure |2(a)| The curves 
in both pictures are averaged over 1000 independent channel 
realizations, simulated as i.i.d. Gaussian random variables with 
zero mean and unit variance. For the sake of simplicity, the 
system is assumed to be symmetric, i.e., the transmitters have 
the same power budget and the interference links are at the 
same distance (i.e., drq = dgr, Vg, r), so that the cross channel 
gains are comparable in average sense. The path loss 7 is 
assumed to be 7 = 2.5. 

From the figures one infers that, as for isolated single-user 
systems or multiple access/broadcast channels, also in MIMO 
interference channels, increasing the number of antennas at 
both the transmitter and the receiver side leads to better per- 
formance. Furthermore^ more interestingly, the figures show 
that the games and 'iS have almost the same performance in 
terms of sum-rate at the NE, even if in the game given the 
strategies of the others, each player does not maximize his own 
rate, as instead he does in the game This is due to the fact 
that the Nash equilibria of game ^ are in general not Pareto 
optimal, since pursuing the maximization of each individual 
objective function in a selfish manner does not guarantee in 
general the achievement of global optimality. 

Example #3: Simultaneous MIMO IWFA vs. sequential MIMO 
IWFA. In Figure 3, we compare the performance of the 
sequential and simultaneous IWFAs applied to game in 
terms of convergence speed, for a multicell cellular system 
composed of 7 (regular) hexagonal cells, as considered in the 
Example #1 [Figure p^(a)| . Here, we assume that in each cell 
there is only one active link, corresponding to the transmission 
from the base station (placed at the center of the cell) to a 

"Recall that, in the case of full column-rank channel matrices, condition 
jClt is meaningless for game and can be applied only to game 




_g_Sim. IWFA, n^=n 




Seq. IWFA, n^=n 




_^Sim. IWFA, n^=n 




,_^,Seq. IWFA, n=r, 


^=2 



1 23456789 10 

iterations (n) 

Fig. 3. Simultaneous IWFA (solid line curves) and sequential 
IWFA (dashed-dot line curves) vs. iterations [subplot (b)] for a 7 
cell (downlink) wideband cellular system [subplot (a)]; Game 
TiTg = = 2 (plus marks) and riT, = tlr^ = 4 (circle marks), 
Q — 7, IG subcarriers, Pqla\ — 7dB, Vr, g £ 



mobile terminal placed at (normalized) distance c? = 0.3 from 
the corner of the cell [see Figure [T(a)) . In Figure [3] we show 
the rate evolution of the links of three cells corresponding to 
the sequential IWFA and simultaneous IWFA as a function of 
the iteration index and averaged on 500 independent channel 
realizations, each of them simulated as FIR filter of order 
i = 6. The curves refer to both cases of = nj; =2 
and riT^ = n_R, = 4 for all links. To make the figure not 
excessively overcrowded, we report only the curves of 3 out 
of 7 links. As expected, the sequential IWFA is slower than 
the simultaneous IWFA, since each user is forced to wait for 
all the users scheduled in advance, before updating his own 
covariance matrix. 



VII. Conclusions 

In this paper we have considered a game theoretical formula- 
tion of the maximization of mutual information on each link, 
subject to power constraints, in the MIMO Gaussian interfer- 
ence channel. We have provided a complete characterization of 
the game, by deriving sufficient conditions guaranteeing both 
uniqueness of the NE and global convergence of the proposed 
totally asynchronous MIMO IWFA. Differently from current 
works in the literature, our results do not make any assumption 
on the structure of the channel matrices, but can be applied 
to arbitrary MIMO interference systems. Surprisingly, the 
uniqueness and convergence conditions in the case of (strictly) 
tall (and/or singular) channel matrices need to be stronger 
that those required in the case of full row-rank channel 
matrices. Finally, we have proposed an alternative algorithm, 
derived from a modified game, with weaker convergence 
conditions and virtually the same performance as the MIMO 
asynchronous IWFA. 
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Appendix A 
Proof of Lemma[T] 

We first introduce the following intermediate result and then 
proceed to the proof of Lemma [T] 

Lemma 14: Let Xo G C"^" be a Hermitian matrix with 
eigendecomposition Xq = UoDqU^, and let =S be the convex 
set defined as 



^ = {Qe 



Q ^ 0, Tr{Q} = Pt} 



(82) 



The matrix projection of Xq with respect to the Frobenius 
norm onto =S, denoted by [Xq]^, is by definition the solution 
to the following convex optimization problem: 



minimize 

X 



IX -X, 



Olli^ 



subject to X G 
and takes the following form: 

[Xo]^-Uo (Do-AiI)+U^, 



(83) 



(84) 



where fj, satisfies the constraint Tr{(Do — /il)^} = Pt- 

Proof: Using Xq — UqDoU^, we rewrite the objective 
function of (l83Tl as 



|X-Xo||t = 



X Do 



(85) 



where we used the unitary invariance of the Frobenius norm 
and X is defined as X ^ U^XUq. since 



X-Do 



> 



Diag (x 



Dr 



(86) 



with equality if and only if X is diagonal, and the power con- 
straint Tr{X} = Tr{X} — Pt depends only on the diagonal 
elements of X, it follows that the optimal X must be diagonal: 
X = D. Denoting by dk — [DJ^.^, and dk.o = [D, 



with 



k = I,-- - ,n, the diagonal entries of matrix D and Dq, 
respectively, the matrix-valued problem dSSl l reduces to the 
following vector convex optimization problem 



minimize 

d>0 



subject to 



E 

fc=i 



{dk — do^kY 



(87) 



fc=i 



dk 



whose unique solution {d\} is given by |15, Lemma 1]: d^ ~ 
{do k — m)^' with fc = 1, • ■ • ,n, where fj, is chosen to satisfy 

ELiKfe-/i)+ = PT. ■ 



written as 11261 (we omit the dependence of Q_g for the sake 
of notation): 



(89) 



Using ((89) and introducing the riy, x riTg unitary matrix ^ 



(Ug, U^), where £ C"^''' 
0, we have, for any given Cg G 



is such that Uf 



-CaP 



c I 



(90) 

riTg-r, denotes the (nr, -r,) x (nr, -r,) identity ma- 

±TT±H 



where I. 

trix and the equality in (|90] l comes from P 7\/(h 
implied from the fact that A/'(H^RljHgq) = A/'(H,,) [recall 
that R_g is positive definite for any Q q G S-q\- It follows 
from Corollary [15] that, for any given Cq G K++, 



[^qq^-q 



- CgPA/-(H„) 







C-q^UTn 



where /i 



is chosen to satisfy the 



(91) 
constraint 



Tr(^(^,I-Diag(D-i, c,I„,^_,j) 
Pq < oo, it is always possible to find a sufficiently large 



Pq. Since 



positive constant Cq < oo, such that {p^ 
the RHS of (|9TJ becomes 



0, and thus 



CgPAA(H, 



q ) 



(92) 



which coincides with the desired solution in (|9]l. In words: For 
any given finite power budget Pq, it is always possible to find 
a positive constant Cq sufficiently large so that the waterfilling 
solution (|9]l does not allocate any power along the eigenvectors 
U^. Exploring the structure of the power allocation in 
it is not difficult to show that this happens if Cg > Cg(Q_g) 



^ (sufficient condition). A (finite) 
is, e.g.. 



upper bound of Cg(Q_g) independent on Q_g 



Cq{Q-q) < Pq 

where Pmax 



Amin(H^lHgg) 



< OO, 

(93) 



= maxq Pq . 



Corollary 15: The Euclidean projection of — Xq onto the 
set cS defined in (l82l l takes the following form: 



[-Xo]^ = Uo (/il - Dc 



^+TT-f^ 



U 



7 



(88) 



where /j, is chosen to satisfy the constraint Tr{(/il — Dq) } = 
Pt. □ 

Proof of Lemma |7] Given g G 57 and Q_g G cS_g, using the 
eigendecomposition of H^RI J (Q„g)Hgg as given in d?), the 
Moore-Penrose pseudoinverse of H^RlJ(Q_g)Hgg can be 



Appendix B 
Proof of Lemma|6]and Theorem|7] 

In this section we prove Lemma |6] and provide some 
intermediate results used in the proof of Theorem [7] given in 
Section IIII-B3I 

We go through the complex differential of the (complex 
valued) functions using the approach in |32|, meaning that 
we treat the complex differential of the complex variable 
and its complex conjugate as independent. In fact, it follows 
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from the definition of complex derivativ^H that, although the 
complex variables z e C (or Z G C™^") and z* (or Z*) 
are related, their derivatives (or differentials) are independent 
(or linear independent in the sense of 1321 Lemma 1]). This 
approach simplifies the derivation of many complex derivative 
expressions. 

Definition 1: Given {Vi,V2) C C"^" x C"^", let 
F(Xi,X2) : Vi X 7)2 <—>■ C^^* be a complex matrix-valued 
function. Let (Xi, X2) be an interior point in 2?i x I?2, and let 
(dXi,dX2) such that (Xi +dXi, X2 +dX2) lies in 2?i x I?2- 
Then, F(Xi,X2) is dijferentiable with respect to its first and 
second argument at (Xi,X2) if 



F(Xi + dXi, X2 + dX2) = F(Xi , X2) 

+dF (Xi , X2 ; dXi , dX2 ) + o (Xi , X2 , dXi , dX2 ) 



where dF(Xi, X2; dXi, dX2) is a linear function in 
(dXi,dX2) and o(Xi, X2, dXi, dX2) contains higher order 
fermi o/(dXi,dX2). □ 

Definition[T]is the generalization of the analogous definition 
given in lISTl Ch 5.4] and ll33l for the real case to complex 
matrix-valued functions (see also ['321). Building on this defi- 
nition, similarly to the real case, one can define the Jacobian 
matrices associated to a differentiable complex matrix-valued 
function. As in |32|, we will consider functions F(X, X*) that 
depend on both X and X*. 

Definition!: Given V C C"^", let F(X,X*) : V x 
2? 1-^ Cpx? i)g a complex matrix-valued function, assumed 
to be differentiable on the interior of V x T). The pq x mn 
Jacobian matrices of F with respect to X and X*, denoted by 
DxF(X, X*) OMc/ Dx*F(X, X*), respectively, are implicitly 
defined by the following differential expression / |32|/ : 



(95) 



dvec(F) ^ (DxF) dvec(X) + (Dx-F) dvec(X* 



□ 

Using the notion of partial derivative, the Jacobian matrices 
DxF(X,X*) and Dx.F(X,X*) in Definition |2] can be 
written as 



DxF(X,X*) = 



Dx.F(X,X* 



9vec(F(X,X*)) 
9vec^(X) ' 

(9vec(F(X, X*)) 
avec^(X*) ■ 



(96) 



In 11321 Lemma 1], the authors proved that, as for the real 
case ||3n, Ch.5, Th.3], the representation in the form (l95l l is 
unique, implying that (l95l l provides the identification rule for 
computing DxF(X,X*) and Dx*F(X,X*). In fact, given 
F(X, X*), one can obtain the Jacobian matrices from the 
differential dF, using the following three-step procedure: 1) 



Given / 



C and X 



the 



+ jxj, with xj{,xi G 

partial derivatives (or Wirtinger derivatives 1331 ) — fi^o) and fi^o) 

d I f d d 

are defined as follows: -—fixo) = - ( fix) - j- — fix) 

ox 2 \axR ax J 

and -^fixo) ^ \ (^fix)+j^fix) 
ax* 2 \oxr axj 

d d 

— fix) and fi^) the variables x and x* , respectively, are treated as 

dx dx* 
independent variables. 



--X0 

Observe that in 



compute the differential of F(X, X*); 2) vectorize to ob- 
tain dvec(F) = A(X,X*)dvec(X) + B(X, X*) dvec(X*); 
and 3) conclude that DxF(X,X*) = A(X,X*) and 
Dx-F(X,X*) = B(X,X*). 

Using the above definitions, we can now prove the follow- 
ing. 

Lemma 16: Let F(X) : V C C"^" ^ CP^« be a complex 
matrix-valued function, assumed to be differentiable at an inte- 
rior point Xq in the set V, with Jacobian matrix at Xq denoted 
by DxF(Xo). Given x ^ [vec^(Re {X}), vec^(Im {X})]^, 
let f(x) : M^mn ]^2p9 ji^g ^g^i vector-valued function 
defined as f(x) = [vec^(Re F({X})), vec'^(ImF({X}))]'^. 
Then, f is differentiable at Xq, with Jacobian matrix given by 



(94) Dxf(xo) 



9 vcc(f (x)) 



9 vec^(x) 

Re{DxF(Xo)} 
Im{DxF(Xo)} 



-Im{DxF(Xo)} 
Re{DxF(Xo)} 



(97) 



Proof: The differentiability of f follows directly from that 
of F. Given X = X^ + jX/, with Xh,X/ G M"^", the 
Jacobian matrix Dxf(xo) of f at Xq is [see (|96ll1: 



Dxf(xo) = 



Dx,Rc{F(Xo)} Dx,Re{F(Xo)} 
Dx,Im{F(Xo)} Dx,Im{F(Xo)} 



(98) 

We compute now the four matrix blocks in Dxf(xo) above. 
Invoking the chain rule for Jacobian matrices ||32l Theorem 
1] and using the following simple rules Dx^ X = Dx^^X* = 
ImnjDx/X = -Dx,X* = jl„,„, we obtain: 

Dx«Re{F(X)} = 2Re{DxRe{F(X)}}, (99) 

Dx,Re{F(X)} = -2Im{DxRe{F(X)}}, (100) 

Dx„Im{F(X)} = 2Re{DxIm{F(X)}}, (101) 

Dx,Im{F(X)} = -2Im{DxIm{F(X)}}. (102) 



Given (|99ll- (ll02b . to complete the proof, we need to compute 
DxRe{F(X)} and DxIm{F(X)}. Invoking the identifica- 
tion rule (|95T l in Definition |2] we have 

dvee (Re {F}) = i dvec (F + F*) 



^DxF ) dvec(X) 



1 



2-^-^^.^^.--. . y^^^^) dvec(X*) 

= (Dx Rc {F(X)}) dvec(X) + (Dx- Re {F(X)}) dvec(X*), 

(103) 

where we used the fact that F = F(X) depends only on X 
(and not on X*), and 

DxRe{F(X)} = 

Dx.Re{F(X)} = 

Similarly, we can compute 

DxIm{F(X)} = 

Dx.Im{F(X)} = 



DxF, 
(DxF)' 



2j 



DxF, 



2j 



(DxF)' 



(104) 
(105) 

(106) 
(107) 
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||D^f(zO(y-x) 



Re{Dxr(ZO} 
Im{DxF(ZO} 



-Ini{DxF(Zt)} 
Re{DxF(ZO} 



vec(Re {Y 
vec(Im {Y 



X}) 
X}) 



II [Re {DxF(Zt)} + j Im {DxF(ZO}] [vec(Re {Y - X}) + j vec(Im {Y - X})] 
||DxF(Z0vec(Y-X)||2 



< 



||DxF(Zt 



1 2, mat 



IIY-XI 



F ' 



(115) 

(116) 
(117) 
(118) 



Introducing ( |104| )-( |107| ) in (|99l)-( |102t and using (|98]l we obtain 
the desired expression for Dxf(x) as given in ( |97] i. This 
completes the proof. ■ 

We have now all the result we need to prove the mean-value 
theorem for complex matrix- valued functions. 

Proof of Lemma^ We organize the proof in two steps. First, 
we apply the classical mean-value theorem given in ( |44] | |34 
Th.5.10] to a proper real scalar function (f>{t), related to the 
original complex matrix-valued function F(X). Then, using 
Lemma [16] we convert the obtained result in the form (|46] |- 
dUll. 

Step 1: Given X and F(X), let x = 
[vec'^(Re{X}), vec'^(lm {X})]'^ and f(x) : ^ ^2pq 

be the real vector-valued function defined as 
f(x) ^ [vcc^(RcF({X})), vcc'^(ImF({X}))]'^. It follows 
directly from the assumptions on F, that f is continuous 
(and differentiable) in (the interior of) its domain. Given 
X, Y G P, with X Y, we introduce the real scalar function 
<j) : [0, 1] ^ R, defined as 



^ <f(y)-f(x), f(ty + (l-t)x) > 



[0,1], 



I|f(y)-f(x)|l2 

(108) 

where < x, y >= x^y and || ||2 is the Euclidean norm. 
Observe that the function (t){t) is well-defined because the set 
2? is convex. Furthermore, is continuous on [0, 1] and 
differentiable on (0, 1), implied from the differentiability of f 
and the scalar product. The first derivative of is equal to: 



d(j>{t) 
dt 



< f (y) - f (x), Dxf (ty +il-t) x) (y - x) > 



|f(y)-f(x) 



(109) 



where in (??) we used the chain rule (see, e.g., II3TI Ch. 5, 
Theorem 8]). 

Function satisfies the conditions of the mean value 
theorem for real scalar functions ll34l Th.5.10], meaning that 
[see ( |44] |1 



Using (??) and 



d(t){z) 



dz 



for some t G (0, 1). (110) 



(1) = 



<^(0) 



< f (y) - f (x), f (y) > 

I|f(y)-f(x)|l2 ' 

<f(y)-f(x),f(x)> 

I|f(y)-f(x)|l2 ' 



(111) 
(112) 



the LHS in ( fTTOl ) becomes: 0(1) - <?!)(0) = ||f(y) - f(x)||2 , 
whereas the RHS becomes 



d(j){z) 



dz 



< f (y) - f (x), D^f {ty + {l-t) x) (y - x) > 



I|f(y)-f(x)|l2 

< ||D,f(ty + (l-t)x)(y-x)||2, 



(113) 



which leads to 



||f(y)-f(x)||2< ||Dxf(ty+(l-i)x)(y-x)||2, (114) 
for some t G (0, 1). 

Step 2: To complete the proof, we need to show that ( l46b 
comes from dl 14b . To this end, we use Lemma [161 as 
detailed next. Since ||f (y) - f (x)||2 = ||F(Y) - F(X)||^, 
we focus only on the RHS of (II 14b . For the sake of no- 
tation, we introduce Zt = tY + (1 — t)X and zt = 
[vcc^(Rc{Zf}), vcc^(Im{Zt})]^, so that the RHS of (ITTl l 
can be written as in the equation ( II 15b at the bottom of the 
page, where we used Lemma [T6] The desired expressions in 
(|46ll-(|47]i follows direcdy from ([TT4l i and (ITTTli-dnsTl. This 
completes the proof. ■ 

We provide also the following lemma that is used in the 
proof of Theorem [7] 

Lemma 17: Let F(X) = (H^R-i(X)H)-\ with 
R(X) ^ C + ^f^, TfcXfcTf , where H G C"><", C G 
C"^™, Tfc G C^^'^S and X = [Xi,---Xfe] with each 
Xfc G C''^'''', such that rank(H) = n, rank(C) — m, and 
R(X) is non singular Then F(X) is differentiable at X and 

DxF(X) - [Gt(X) Gi(X), • • • , G;,(X) ® Gk(X)] 

(119) 



where 



Gfc(X) = (H^R-i(X)H) ^H^R-i(X)Tfc. 



(120) 



Proof We compute first the differential of F(X) and then 
invoking the identification rule in ( l95T l we identify the Jacobian 
matrix. Function F is differentiable at X, with differential 
given by 

dvecF(X) 

= - (H«R-i (X)H) H^dR-i (X)H (H^R^i (X)H) 
= Ef=iGfc(X)dX,Gf(X), 

(121) 

where we used dX^^ = -X-^dX-iX-^ ^ and 
(|120b . By vectorizing dF(X) and using vcc(ABC) = 
(C^ •» A) vec(B) (see, e.g., IJl], I26J) we obtain 



dvecF(X) 

- [Gt(X)®Gi(X), 



,GJf(X)«)Gif(X)]dvecX, 

(122) 
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which, using the identification rule in ( |95] ) leads to the desired 
structure of the Jacobian DxF(X) as given in (11 19l l. ■ 

Appendix C 
Proof of Theorem[9] 

The existence of a NE of game 'i§ for any set of channel 
matrices and power budget follows directly from |16 Th. 
6] (i.e., quasiconcave payoff functions and convex compact 
strategy sets). As far as the uniqueness of the NE is concerned, 
a sufficient condition for the uniqueness of the equilibrium 
is that the waterfilling mapping in (|9]l be a contraction with 
respect to some norm |23 , Prop. 1.1. (a)]. Hence, the sufficiency 
of (ICll l in the case of full row-rank and full column-rank 
channel matrices {Hgqjggsi follows from Theorems |5] and 
|7] respectively. Finally, the equivalences ||S||^jj,at < 1 ^ 
p(S) < 1 and ||S(^*)||-,^,t < 1 ^ p(S(^*)) < 1 can be 
proved using [23 Cor. 6.1] (cf. HH Th. 6]). 

We focus now on the more general case in which the 
channel matrices H^^ may be rank deficient and prove that 
condition (IClb in Theorem |9] is still sufficient to guarantee 
the uniqueness of the NE. Let 17 C O be the subset of 
containing the users' indexes q such that rank(Hgq) = < 
min(nT, For each g e il, given the SVD of H,, = 
Ug.iS^^V^i, where U<,,i e C"«<!^'-', Vg,i G C"^'!^''' 
are semi-unitary matrices and G C' 



where R_ = R_g(Q_ 



= R-nn, + E Hr-jQrH.^^; ^nd 

coincides with 3,q defined in ( 11241 ) if g G il, whereas 
it coincides with defined in (|6]l if q G ri\ri. It is 
straightforward to see that conditions for the uniqueness of 
the NE of game guarantee also the uniqueness of the NE 
of ^. _ _ 

Observe that, in the game 'S , all channel matrices Hg^ are 
full column-rank matrices. We can thus use Theorem [T] and 
obtain the following sufficient condition for the uniqueness of 
the NE of both games ^ and 



p(S(^*))<l, 



(127) 



is a diagonal 

matrices with positive entries, let PA/'(H,g)^ = ^q-^^\ 
be the orthogonal projection onto the subspace orthogonal 
to the null-space of Hg^. Since < mm{nT , nn ), it 
is not difficult to show that the best-response of each user 
Q* = WFg(Q_g)— the solution of the rate-maximization 8 
problem in (© for a given Q_, G ^_g-will be orthogonal \,{Grq) < A,(Vf G,,V,) < A,+„^„_,„ (G,,), (129) 

to the null space of fiqq, whatever Q q G is, implying 

It follows then that the best Vi = 1, 2 



with 

p fv«Hf^P;!;H«f H«^P*^H,,V,) ,if r G H, r ^ q, 

p [ii^qP*" H«f H« ,P*^H,,) , if r G f7 \ H, r ^ 

if r = q, 

(128) 

with 3^* defined in Theorem [T] [recall that 
P*g — I if rank(Hgq) = nji^, which in- 
cludes the square nonsingular case], where in 
(EHJ we used p(HfgPfXfHj,P.,H,,) 

p (V,f Hf^Pf,H«f H«^P,,,H,.,VO. 

We show now that condition (IClb in Theorem|9]is sufficient 
for (1127b . Introducing the Hermitian positive semidefinite 
matrix G,, ^ Hf^Pf^H«f H«gP,,H,, and invoking the 
Poincar separation theorem [26. Cor. 4.3.16], we have for any 



iven Vrq G 



- PA/'(H„)iQgPA^(H2_ 

response of each user 5 G il belongs to the following class of 
matrices: 

Q9=V,,i^„Vfi, (123) 



with 



G Bq = {X G C^'^"' : X )^ 0, Tr(X) = Pq] . 

(124) 

Using ( 1123b and introducing the (possibly) lower-dimensional 
covariance matrices {Qgjgesi ™d the modified channel ma- 
trices {Hrqj^^gj^, defined respectively as 



' I Qgi Otherwise, 



(125) 



A f H^gVr,!, if r G 
1 Urn, Otherwise, 



game can be recast in the following lower-dimensional game 
defined as 

maximize loa;det ( I + H„„R_„HooQn ) 

(^) : Q, _ _^ « V Vg G 17, 

subject to Q G 

(126) 



jTr, where the eigenvalues are arranged 

in increasing order It follows from ( |129b that p (V^G^gV^) 
< p [Grq] , for all r G fi, g G f7 with r ^ q and fixed 
Prq e ^, and thus S(^*) < Si^*), where S is defined 
in (dH). The sufficiency of dcD for (TTH follows from: < 
S < S =^ p(S) < p{S) [25, Cor. 2.2.22]; which completes 
the proof. ■ 
Appendix D 
Proof of Theorem [12] 

We may focus w.l.o.g. only on the case in which the channel 
matrices {Hgqjggsi are full (row/column) rank matrices. The 
case of rank deficient matrices can be cast in that of full 
column-rank channel matrices, as proved in Appendix IcT^ 
Moreover, for the sake of notation simplicity, we consider 
in the following only the case in which rank(Hqq) ~ np^ 
for all q G 17 or rank(Hg(,) = ut for all g G O. The 
case in which some channels Hg^ are full column-rank and 

'^It is straightforward to see that conditions for the convergence of the 
asynchronous MIMO IWFA applied to game defined in )126t guarantee 
also convergence of the asynchi'onous MIMO IWFA applied to the original 
game 5^. Observe that, since the asynchronous IWFA applied to game is 
allowed to start from any arbitrary point Q'"' in £1, it may happen that Q''^) 
does not belong to the class of matrices defined in (124). However, all the 
points produced by the algorithm after the first iteration from all the u sers 
(J g n as well as the Nash equilibria of are confined in the class 1124) for 
all g g n. 
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some Others are full row-rank can be similarly addressed. 
The proof consists in showing that, under ( |27] i of Theorem 
ID (if rank(Hqq) ur^, Vq £ 17) or (|50l) of Theorem |7] (if 
rank(Hgq) — nr , '^q & conditions of the asynchronous 
convergence theorem in |23, Prop. 2.1] are satisfied by the 
asynchronous MIMO IWFA described in Algorithm 1. The 
asynchronous convergence theorem applied to Algorithm 1 can 
be restated as follows. 



Theorem 18 (Asynchronous convergence theorem): Given 
the waterftlling mapping WF ={'WFq)q^Q : £S ^ with 
WF^(-) and J2q defined in ( I-/5I ) and respectively, assume 
that there exists a sequence of nonempty sets {^{k)}^. with 

...CX{k + l)^X{k)c...(Z^, (130) 

satisfying the next two conditions. 

C.l (Synchronous Convergence Condition) 

WF(Q)eA'(fc + l), VfcandQ eA'(fc). (131) 

Furthermore, if {Q''^^} is a sequence such that 
Q^^^^^^X^k), for every k, then every limit point of 
{Qf'')} is a fixed point o/ WF(-). 

C.2 (Box Condition) For every k there exist sets Xq{k) C 



such that 



X{k) = Xi{k) X ... X Xqik). 



(132) 



Then, every limit point of {Q^*^^} generated by the asyn- 
chronous MIMO IWFA in Algorithm 1, starting from Q^*'^ £ 
^"(0), is a fixed point of WF ( • ) ■ □ 

Assume that condition (|27] i in Theorem |5] (if rank(Hgq) ~ 
nii , Vg e f2 ) or condition ( ISOl l in Theorem|7](if rank(Hgq) = 
fiT , yq & ^) is satisfied, i.e.. 



/3-/3(w,S(^*))^||S(^*) 



<1, 



(133) 



for some w > 0, implying that the waterfilling mapping WF 
admits a unique fixed-point, denoted by Q* (cf. Theorem |9]l. 
Given 



,(0) 



with II • ||?;,bi„,, 

in Theorem [18 



max ||Q(") 

Q(0)eJ2 



Q 



* II w 
F.block 



< +00, (134) 



given in (fTSI l. let define the candidate set X{k) 



las 



X{k) = {q G ^ : HQ - Q1|^,„„,, < eSL} C ^, 

(135) 

which is equal to the Cartesian product X{k) — Y['^=i ^q{k), 
where 

Xq{k) ^ [Qq e J3q I 1 1 Q, - 1 1 ^ < /J^' G^L } C . 

(136) 

This implies box condition C.2. Observe that, since (3 < 1, set 
X{k) in (|135l l satisfies also condition ( |130l l. Synchronous con- 
vergence condition C.l follows from the contraction property 



of the waterfilling mapping as proved in (|26] l (cf. Theorem |5j 
under /3 < 1: 



Q 



(fe+i) 



WF(Q('=') -Q* 



F.block . - - - ^block 

< p _ q* < /3'=+ieSL- 

F,block 

(137) 

This impUes Q^^+^'> e X{k + 1) for all fc > 0, whenever 
Q'*'' G X{k), as required in ( I131l l. Finally, the convergence 
to the unique fixed-point of WF( ) of the simultaneous MIMO 
IWFA given by Q'^'^'^G X{k) for all fc, with initial conditions 
Q("^G A'(O) — =S, follows from the contraction property of 
the waterfilling mapping |23, Prop, l.l.b]. 

Therefore, under (I133l l. the asynchronous MIMO IWFA 
given in Algorithm 1 converges to the unique NE of game 
for any set of initial conditions in =S and updating schedule. 
The equivalence between (I133l l and (ICll l has already been 
proved in Appendix ICl ■ 
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